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DISK BASED DISK CACHE INTERFACING 
SYSTEM AND METHOD 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

Commonly-assigned U.S. patent applications Ser. No. 
08/561,337, filed Nov. 21, 1995 entitled "Improved Fault 
Tolerant Controller System and Method" by W. A. Bram, M. 
E. Niclson an G. Howard; Scr. No. 08/363,132 entitled *'A 
Fault Tolerant Memory System" by G. Neben. W. A. Brant 
and M. E. NieUon; and Scr. No. 08/363,655 entitled 
•'Method and Apparatus for Fault Tolerant Fast Writes 
Through Buffer Dumping" by W- A. Brant, G. Nsben, M. E. 
Nielson and D. C. Siallmo (a continuation-in-part applica- 
tion of U.S. Ser. Nfo. OS/112,791 by Brant and Stallmo which 
is itself a contmuauon-in-part of application Set. No. 638, 
167 filed Jan. 6, 1991 by Brant, Stallmo, Walker and Lui the 
latter of which is now VS. Pat. No. 5,274,799) describe fault 
tolerant data processing apparatus and processes which 20 
interface between hosts and data storing subsystems. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to data processing structures 
and procedures which provide interfacing between computer 
type systems and low cost but slow performing mass data 
storage devices or subsystems. More particularly, the present 
invention relates to digital data handling structures and 
methods which optima cost/performance interfacing 
between host systems which exchange data with relatively 
inexpensive high capacity data storage devices. While not 
necessarily so limited, the present invention is especially 
useful in systems having a dynamic ability to overcome 
errors resulting from various sources during transmission of 35 
digital data between computer oriented completes. 

2. Description of the Related Art 

The costs of manufacturing disk drives as a media for 
storing digital data has reduced dramatically in recent years. 
It has far outpaced the cost reductions associated with solid 
state and other data storage devices. Smaller form factor disk 
drives permit disk subsystems to exploit the performance 
advantages of having more disk drives to service requests in 
parallel. 

The components of disk subsystems have also become 
significantly more reliable in recent years. Mean Time 
Between Failure (MTBF) ratings have progressed from 
15,000 hours in 1988 to 500,000-800,000 hours for both 
disk drives and power supplies. The MTBF for controller so 
electronics continues to improve with integration and lower 
power requirements. 

The personal computer (PQ technology has become ever 
more popular for use as major elements in data processing 
and handling systems and subsystems. System configura- 55 
tions known as RAID (Redundant Array of Inexpensive 
Disks) computers have likewise evolved. 

Caching controllers that interface with host computers or 
the like for directing data exchanges with large arrays of 
magnetic data storing disks Or other storage media, have 
developed for providing a storage medium for large quan- 
tities of digital information. These controllers respond to 
read and write commands from a remote computer system to 
receive, and/or deliver data over interconnecting busses. 
They often employ expensive solid state storage, such as « 
RAM, to cache host data u> minimize the relatively long 
latency of the disk subsystem. 



AQ 



45 



60 



Preferably, the caching controller should function s o that 



it minimizes delays and demands on the host system, 
including the ability to recover wherever possible* 



errors from single points of failure. System configur rtions 
and operations capable of dynamically overcoming angle 
points of failure are sometimes referred to as fault t< ierant 
systems. The aforementioned related patent applications 
describe such redundant fault tolerant systems and itera- 
tions in a disk array controller environment. 

It is known to improve storage system reliability by 
redundant recordings of the same data on duplicate t ,pc or 
disk drive systems. This allows improved reliability ijj that 
the data is still available from the back-up recording in the 
event of failure of the primary drive, Furthermore, the 
redundancy helps to somewhat reduce the amount c 1 time 
required to read that data in that the controller car issue 
concurrent read commands to both of the redundant & orage 
devices. The data is selected from the drive whicjji first 
produces it. 

The cache controller avoids wait time by thi|j host 
computer, or central processor, in reading or writing r ! lative 
to a disk by buffering write data into a protects I fast 



memory, and servicing most read data from fast mere 



system, as described in the cross-referenced cop<jpding 
patent application, can include redundant storage ncdia 
array controllers for responding to host computer requests 
for transferring data between that host computer t id an 
arrangement for low cost but large quantity data storage 

In "Computer Architecture, A Quantitative Approa ; h" by 
D. A. Paxterson and J. L. Hennessey (Morgan Kaulmann 
Publishers, Inc., Second Edition, 1990, 1996), dis;usses 
processor memory, or RAM, and how it is eacl ;!d. It 
describes the disciplines, such as direct map, set assoc ativc, 
and the like, 



SUMMARY OF THE INVENTION 



aj 'ay 



The present invention advantageously utilizes an 
relatively slow devices, such as low cost disk drives^ . 
the data replicated across those devices so as to ci 
probability that one of the read/write heads is 
close to the desired data upon the occurrence of 
command. The controller renders a logical decision) 
which drive can produce the desired data with 
delay. 

The present invention deliberately uses mapping 
caching techniques to replicate data across a multipli 
disk drives to substantially enhance the ability to 
recover the data. The data is replicated across the drivi 
replications of the data on the same drive is 
possible, thereby minimizing the time for a rcad/wrii{ 
to encounter the desired data. Thus, inexpensive 
improves the speed of data recovery. 

The controller optimizes the seek and rotational 1 
times. A given controller can concurrently service a pi 
of data recovery operations. The mere presence of 
ocular data exchange request as from the host is not 
sarily the next item that will receive execution. Th 
troller determines which service requests are queued 
of a given request, and further projects where each 
read/write heads of the disk drives will reside at lha 
Thus, the controller can determine which disk drive 
will actually recover the requested data well in advaji 
its actually producing that result. 

A series of products based on a single hardware _ 
further leverages the value of using contemporary, c 
shelf components. A disk based disk cache, a high 
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mancc simplex and/or duplex mirrored disk subsystem, and exchange controlling commands from a remote co muter, 
a low maintenance mass storage unit 'using RAIDS are These commands are utilized in conjunction with >a fast 
possible. RAID3 configurations can be configured with memory to buffer data accompanying such COriinands 

hardware accelerators. between the computer and at least one satellite data : Jorage 
> The most economical disk drives have found cominuing 5 *nil, sucb as an array of disk drives. The memory ooi ,croller 

favor in high volume applications, such as PCs and work- lDclu ^ s t lcmcDts whose primary purpos,| is for 
stations. Leveraging the cost performance of these disk ^ievm £ daia from the fast memory for transfer to ar output 

drives into high read/write performance applications rmutai, ^ \ 

requires packaging large numbers of disks coupled with a ^%Tl * * ^ 7** * ve T* P ^ 

contmLertoiTnage The complexity of distributing data 10 ^ r ^ i data prc<:essmg devxee .and a 

among all the drives, and the use of data parity codes and *>* ^ nQ S lar f quantities of data. Aeon* >Uer is 

^ r ^.71^1 7xiTDc coupled for exchanging data and commands with tt ? host, 

data smping to mitigate the decreased MTBE and with the subsystem. A plurality of low cost dv ks are 

As disk drives become smaller, the performance advan- coupled to this controller. The controller responds t:j com- 

tages of using very large numbers of disks becomes feasible mands from the host to read data from an address iin the 

since space requirements are small, and power and control a5 subsystem by inspecting the contents of the array of It: w cost 

distribution is over small distances. As the cost per megabyte disks to identify the presence of data corresponding ; !to the 

(MB) falls, it becomes less expensive to increase disk address from the host. The controller transfers the da :a from 

subsystem performance by storing the data on multiple the hosi specified daia address from the disk to the ha \ if the 

drives instead of using expensive controllers that distribute data is found to reside there. 

and reconstruct data with parity, such as in RAIDS systems. 20 This controller can also function upon a failure b find 

With regard to disk based disk cache management daia in the disk array corresponding to the host specified 

strategies, the mechanical characteristics and me size of the address by recovering the data from the subsystem f , jr both 

disk based disk Cache suggest special constraints for man- transfer to the host and storage in a plurality of disk; ! in the 

aging the disk based disk cache that would not apply to „ ™* The controUer can 4so sense the disk hav pg its 

conventional caching mechanisms. Different management * gjjgj £ S33£^ * ? 

strategies will have varying impact on complexity/ r __ . ,, . , . ... > . . 

performance measures. ^ cootroller can further determine the nine requ yd for 

accommodating any previously occurring operations with 

The oore technologies of fault tolerant subsystem design ^ disk stray before determining which disk has * said 

and high transition processing capabilities support new head closest to the data associated with a given read jjquest 

architectures aod platforms to create product differentiation ft 0ta t ^ 6 ^osi 

through performance. High performance subsystems have ^ method of ^ iDvenUoQ utflj2es ?f , 

commande4a 0 dw l Uconnnue to command, highermargins cost dkks for iwer focing between a host data pressing 

*an those based on contemporary features and capacity. dcvicc a ^system capable of storing Urge qnani ties of 

I other performance driven manufaenwers of all x ^ ^ me0 ^ ^ * of * VTJ data 

^^J^^LX^ m £ exchanging commands from the host and respon ng to 

performance flagship product. commands from the host to read data from an addrcs; in the 

The subsystem in accordance with this invention exploits subsystem by inspecting the contents of the disks in the 

the cost and high reliability characteristics of high volume, This inspect jg to determine whether it s. data 

widely available small form factor disks drives and standard 40 corresponding to the address is present in at least om of the 

processor platforms. The system incorporates components disks 

of such high reliability or sufficient spares^ sue h that hot ■ Data ^ mc ^ ^ - m via&Uwd (t0m of 

pluggable Customer Replaceable Units (CRU's) are not me ^ ^ ^ ^ i„ response to a favorable resn I from 

necessary to ensure adequate servace hfe. Theprwentmven- me Opening stcp . ConvVrseiy, a failure to produce a 

^SS^S^V^f^f tJ 01 4S positive^resultWtbe inspecting Step results in the , epsof 

srnMler) disk drives^ as well as IBM PC, Mips PC, Or Dec reweriag tbc da ta from the subsystem, transferring that 

Alpha PC compatible motherboards. It can employ pur- recovcrod data to the requesting host, and storix | that 

chas^mulataskmg operating systems and standard power rec0 vered data in a plurality of dfeks in the array. • 

%*? . ... . , . ~_ Once presence of the specified data 00 the disks of the 

Tbe advantages of tins approach are marnfoW. Tbese so is ^determined, the disk having its rcad/writT head 

include exploitation of contemporary core technologies (not q1q ^ [q ^ dm ated b ^tat & L d (he 

mere implementations), dual controller/fault tolerant storage data fa duce<J ^ h y ^ ^ sdecitd £ £ If 

suborns env^runeotavconfiguxaoon sense and control, ncccs ^ ^ mcthod of inchldes the ?p of 

controllers based on PC technology, development tools that determiniag time required for accommodating 

are numerous^ cheap, and sophisncaied, volumes and com- JS ^ ^ ^ thc ^ prio r to p t 

peutu« which jomdy keep the cost per MIP very low, tod mg ^ disk selecting step. 

party development of faster controllers, third party interf ace r ...„ . . ,„ A1 _ 

hardware and driver software availabihty, and tfiird pany f ^ osc ^rmal sWJ in tbc art w,n recogn, f e the 

oneratinc systems foregomg and other objects, features, advantages an<; 

V m. . « ^ * cations of the present invention from thc following ' 

Hie present invention is concerned with controllers that 60 detailecI description of the preferred embodiments a 

execute data exchanges between requesting host computers lratcd ^ thc accompanying drawings, 
and one or more satellite data storage units associated with 

that controller. Thc present invention can accommodate BRIEF DESCRIPTION OF THE DRAWING^ 

implementation in conjunction with a caching disk array • FIG. I is a system block diagram of a disk base 

controller in a fault tolerant disk array subsystem. <$5 array coupled to perform mierfacdng between a hos 

A controller in which one feature of the present invennon mass daia storage subsystem in accordance with the ] resent 

is useful has a plurality of elements arranged to receive data invention. 
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FIG. 2 shows a typical circuit board module configuration trollcr 20 module includes a processor chip with its issod- 
implcmcnting the present invention. at ed high speed registers and buffer memory. 

FIG. 3 is a physical depiction of a typical disk mounting The system shown in FIG. 1 is an improveme \ over 

board suitable tor attachment to a motherboard shown in either directly coupling host 10 to the high volurj c data 
FIG . 2. 5 storage subsystem 25, or using a solid state type devic i, such 

FIG. 4 is block diagram of a logical depiction of the disk 33 cacnia S though use of a HAM for such interfacing . As to 
subsystem in accordance with the present invention. the fo *»ef situation, the improvement is in that it]lgnifi- 

cantly onloads the host computer from dedicated did ^ man- 
DETAILED DESCRIPTION OF THE agemerrt tusks. With regard to the latter, the high cost f solid 

PREFERRED EMBODIMENTS 10 s^tc interfacing devices is significantly avoided by } large 

margin, while providing an acceptable compromise in 
The storage hierarchy associated with the various con- approximating the operating speed: thereof, 
temporary storage conngurations appears as follows in order Array 22 as shown in FIG. 2 includes a printed [circuit 
of highest cost but fastest performance first and lowest cost (PC) board 50. The base 51 of board 50 is aixac icd to 
but slowest performance last: 35 motherboard 40 via connector 52 shown in FIG. 3. Si ; small 

#) are 
p. and 
m are 



1. The registers of a Microprocessor; disk drives 53-58 (1,8-inch disk drives, for examr 

2. An on-chip cache; attached to base 51 of PC board 50. The host interface 

3. An off-chip cache- xhc interface 24 to the mass data storage subsysv 
4 Main mernnrv- ' likewise connected as boards to motherboard 50. 

' "TT ™ cmor y» 20 FIG. 4 presents the logical diagram of the disc 

5. I/O buffers; 53-58 which communicate in pairs through sianda 

6. A solid state disk cache; drive interfaces 61, 62 and 63. The IDE standarr. 



8. A high performance disk; suffice for interfaces Thus, disks 53-58 

9 A hieh caoacitv disk: 25 disJcs of me based ^ ***** interfacing system 

, * <s, t • T ^* J . . . olluir d ^ shown in array 22 might actually prov 

10. Slower high capacity disks; and disk storage devices for disk subsystem 25 possibl) 



pVives 
fci disk 
for a 

7. A disk based disk cache; PC-style jhsk drive interface handles pairs of driv&, And 

; , jre the 
while 
jde the 
under 

U. Optical, tape and/or library. the supervision of another controller (not si own). 

A storage subsystem that has the MB cost of disk coupled Conversely, an entirely separate high capacity data ji forage 
with the performance of many disks operated in parallel can 30 device can independently provide the subsystem 2i\ func- 
fill several intermediate slots in this hierarchy. Spin syn- lions, 
chronizing of the disks can help keep disk access times Internal data bus 59 can take the form of EISA, VJ SAor 

relatively low. Further down the hierarchy, inexpensive PCI types of standard disk drive interfaces. Application 
controllers coupled to an array in RAlDl conngurations can Specific Integrated Circuit (ASIC) 60 functions in re ponse 
yield high I/O rates. 35 to commanding signals from the controller 20 to acco xplLsh 

Still further down the hierarchy, RAIDS configurations actual data exchanges with 53-58. ASIC 60 sub- 
reduce the cost of protected storage with small redundancy stantially subservient to controller 20 to address } disk 
groups and higher capacity disk (more data under each disk drives for storing or retrieving data, 
head). Even higher capacity, but slower, configurations of Accordingly, host 10 interfaces with the disk ba$< J disk 
protected data can be built with high capacity drives and 40 cache on board 40. Small Compuxer System In jrf acc 
large redundancy groups; i.e., 35+1. All these products can (SCSI) type connections arc well suited for host interf ce 11, 
be based on a common purchased hardware platform and as well as interface 24. Both cpmmahds and data to an : I from 
contemporary developed ASIC based daughter cards, the host computers 10 arc handled via connections U all of 
RAIDS is supported with daughter cards with flow through which can occur concurrently. Through local busses local 
parity generators. as processor 20 strips host 10 originated command signal i from 

The preferred embodirncnts of the present invention are the interfaces to direct the operation of its fast memory 
herein described in the environment of a data processing elements and disk array 22 in handling the data associated 
system, including at least one remote host computer, such as with those commands. 

host 10 in FIG. 1. Host 10 is in a relatively remote location Controller 20 determines t}ie allocation Of low co& disk 
with regard to disk subsystems responsive to interface 50 array 22 in transferring data across interface 24 for js orage 
exchanging local controller 20. Host 10 is in bidirectional in low cost (but low speed) disk subsystem 25. iyj jpally, 
communication with controller 20 of a disk based disk cache subsystem 25 includes its own controller to handl! data 
subsystem 16 via link U. exchanges coordinated with controller 20. A typical bi Jard is 

Controller 20 employs its own buffer to interface with a shown in FIG. 2 wherein interfacing modules 11 and 
separate disk based disk Cache array 22 via bidirectional 55 coupled to cooperate with controller 20. Array 22 is m 
connections, such as connector 15. Fully redundant data a multiplicity of small disk drives which Can encc 
paths interconnect the host with the controllers, as well as both the disk cache amy 50 and, if desired, the disks o] 
between the controllers. Controller 20 can include indepen- storage system 25 in FIG. 1. 

dent paths to write data to its memory in a mirrored fashion A variety of possible disk based disk cache manag 
so mat data is protected against loss. $q strategics are possible. The mechanical characteristic $ and 

FIGS. 1 and 2 illustrate the use of a plurality of disk drives the size of the disk based d fcV cache suggest special 
22 With FIG. 2 presenting the physical top view of a board straints for managing the disk based disk cache that i 
implemented in accordance with the present invention. not apply to conventional caching mechanisms. Di jerent 
Floppy disk drive 45 is included for introducing programs management strategics will have varying impaU on 
into the controllers and storage units of board 40. Power 65 complexity/performance measures. The following oi tfincs 
supply 46 renders board 40 self-contained, while battery 48 several strategies potentially available for applicat 
provides temporary back-up power for board 40. The con- managing disk based cache systems. 
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A direct mapped strategy uses a simple function, such as passed through to subsystem 25 for storage. The (ata is 

a modulo operation, to map Logical blocks to a physical likewise placed in a cache SO location at the I cation 

location. A direct mapped strategy always writes a specific specified by the lower sixteeo-biis of the address fix n host 

logical block to the same physical location. Therefore, the 10. The controller 20 might place the data on all of lh< drives 

search entails a constant rime segment. 5 associated with cache 50. A subsequent request for a] ead of 

A typical direct mapped write operation writes N copies that same data by host 10 results in controller 20 deter i pining 

of the data to the disk based disk cache comprised of N disks where the request resides in the queue of operatioj & and 

with one copy to each disk. In fault tolerant systems, the host which read/write bead of one of the drives on boar I 50 is 

may be signalled that the write operation is complete as soon closest to that data. Controller 20 knows where each i (isk.of 

as two of the N writes are completed. The logical write 30 the cache is in its rotation. 

operation need not wait for the completion of all physical Three operations involve rime in association with in I/O 

operations. function. The first relates to the wait in a queue of ot er I/O 

Unlike write operations, direct mapped read operations operation requests ahead of the request under consid* ration, 

only need to access one device within the disk based disk The next time increment is the latency time which stems 

cache. Therefore, the cache management strategy should is from the time it takes a read/write bead to acm [re an 

select the device which can most quickly access the data. appropriate position over the data. The final rime ine emem 

The factors that affect device access time are seek and is the transfer time associated with acquisition of tup data 

rotational latency of each candidate drive, and the time from the disk, and actual transfer to the requesting ost or 

necessary to complete previously queued operations for that other device. The queuing and latency times are minSmized 

drive. The cache management strategy minimizes device 20 by the processor of controller 20. 

queuing by selecting the device which can most quickly In operation, the processor of controller 20 in accoi dance 

access the data. Device rotational synchronization can mini- with die present invention responds to a read requcj , from 

mize device latency. Devices are preferably synchronized so host 10 by inspecting the contents of its associate \\ disk 

that the rotation of each device is offset by 1/N (where N is cache. If the address of the requested data fails to ma 1 ph the 

the number of devices in tbe system). 25 disk cache stored dara, a decision is made to pass tlio data 

lo direct mapping, the drives of array 22 can be consid- read request directly through \o the mass data storage 1 [evice 

ertd to represent a linear set of logical data storage addresses 25. The data acquisition is accomplished with substo ilially 

with each sector stacked on top of the other. If, for example, the same time increment as would occur if host H were 

mere are twenty drives, each with twenty addressable directly coupled lo storage subsystem 25. 

sectors, there are four-hundred addressable sectors. The 30 However, when a match does occur, controller 20 ecog- 

access time for all those sectors is not the same. Ii depends nizes that ii is possible to rctriove that data from the cache 

upon die rotational latency and the seek latency of each storage, and return it to host }0 at a considerably fas* *r rate 

individual drive with regard to the actual location of the than if the request were directjy transferred to subsyst m 25. 

read/write head. The use of a plurality of low cost disk drives for the pacbe 

The number of bytes of data retainable by array 22 is 35 with common data stored thereon and the recogninpn of 

markedly smaller than subsystem 25 can Store. Direct map which disk has its tcad/write head closest to that data r ; sders 

is a relatively simple algorithm. A certain number of bits it possible to rapidly acquire and transfer the daw} p the 

describe the address of data in store 25, but it is possible to requesting host 10. 

ignore the top of the address as this identifies the location of Write commands from the host 10 are imme< iately 

that byte in storage array 22. For instance, if store 22 has 40 recognized, and promptly passed to storage subsyst m 25 

65,000 locations with sixteen bits of address while Store 25 without delay. 

has 4 gigabytes located in ihirty-two address bits, a byte in Another disk based disk cache management stratqj y is a 

store 22 shares addresses with 65K locations of store 25. set associative strategy. Set association uses a simpk func- 

Store 22 is selected lo have enough capacity to hold the most tion (like a modulo operation) to map a logical block a : |o*ress 

important data that is used frequently by host 10. Data which 45 to a table entry. Each table entry contains a fixed nun >cr of 

becomes unimportant is replaced by some other dara that has physical locations in which the logical block mi|;(ii be 

acquired importance in the system operation. located. This approach helps alleviate thrashing that ; jpme- 

Host 10 specifies a particular address associated with the times occurs for some workloads with a direct itj ippcd 

subsystem 25 storage. Hie lower segment of the address is strategy. 

used by a straightforward algorithm that allows the control- 50 Multiple variations of set association are possible dc ; pend- 

ler 20 to map into the disk cache. When host 10 requests a iog upon the eviction algorithm used to manage the table 

read operation from a ihirty-two bit address, the disk cache entries. For example, the logical data within the tabh entry 

controller 20 looks at the lower sixteen bits. Controller 20 can be evicted in a random, revolving, least rccontJ> used 

retrieves the byte in conjunction with a tag or page frame (LRU) or fastest fit algorithm. The optimum table entry 

number which completes identification of the location 55 management approach depends upon the cache work I pad. 

address with respect to the storage in subsystem 25. A fully associative LRU cache management stijuegy 

Controller 20 recognizes that a match of the upper sixteen allows any logical data block to be mapped to any pb fsical 

address bits represents data contained at the address speci- location. With a true LRU, logical write operations cvift the 

fied by the lower sixteen bits. Controller 20 thus recognizes least recently used logical data block. Logical read 1 : pcra- 

ihai the data is in the cache memory 50, and therefore can 60 tions use an index mechanism^ such as a hash table, 1 > find 

produce that data to host 10 with minimum delay. A failure the physical location of the logical data. This app roach 

to match means that access to storage subsystem 25 is favors cache workloads with a high locality of rcfcr< ice. 

needed. However, the retrieved data is stored in cache 50, The fully associative, fastest fit cache managemeni strat- 

and also transferred to host 10. cgy evicts the closest possible physical location in oti(er to 

In the exemplary embodiment herein described, the disk 65 optimize the write operation. Due to read operation jntcr- 

53-58 array on board 50 is designated as the disk based disk spersed between the write operations, logical write < pera- 

cachc. On a write operation specified by host 10, the data is tions will cause each physical disk 'to evict differed plat* 
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Therefore, ihe fastest lit strategy is not a mirroring approach. said array of disk drive storage devices utilizing 

Writes may be optimized at the expense of slower reads or recently uscd/f astcst fir cache management a ; rategy 

even cache misses. Reads require an index mechanism, such aod means responsive to host read data commi Ids for 

as a hash table, to find the physical location of the logical inspecting the contents of said disks to delnmnc 

a Aix)mer disk based disk cache management strategy is the 5 whether the data corresponding to said add 

fully associative, hybrid LRU/fastest fit strategy. Instead of P rcscnt ther6m ' said COnlroUcr ftmhcr ^eluding 

evicting the least recently used data item, a hybrid system responsive to a favorable result from said ins .ecting 

searches the N least recently used entries to find the fastest means for transferring said address corrcspondii ig data 

fit. The strategy offers some write performance without from said disk to the host. 

sacrificing cache effectiveness. The meta data required by 2 6. Apparatus in accordance with claim 5 where n said 

this strategy is the same as the LRU strategy, coupling means includes an ASIC module. 

While the exemplary preferred embodiment of the 7. Apparatus in accordance with claim 5 wherein si itf disk 

present invention are described herein with particularity, cache includes a plurality of disk drives mounted on * circuit 

those having normal skill in the art will recognize various board 

changes, modifications, additions and applications other 15 o 1 ^ . . . . . „ . . . , , , 

than those specifically mentioned herein without departing M *^*7< m * c ™^™™*<**** ^ « 1 ?™« 

from.the spirit Of this invention. a womerboard for receiving said disk d* ve circuit bo rdand 

What is claimed is: having said processor mounted thereon. 

1. Apparatus for interfacing between a host data process- 9 ' A PPa™^ in accordance with, claim 8 which ill eludes 
ing device and a subsystem for storing large quantities of 20 a disk dnvc for receiving separate disks mounted <: n said 
data comprising: ^ ^ ^ motherboard for Selectively introducing programs io said 

a controller coupled for exchanging data and commands processor. 

with the host and with the subsystem; 10, Apparatus in accordance with claim 9 which ii eludes 

a multidimensional array of disk drive' storage devices a plurality of said disk drive circuit boards attached o said 

coupled to said controller; and 25 motherboard, said controller including means for dc ;ignat~ 

said controller including means for distributing multiple mg which of me disk ^ves P<> said circuit boards fii action 

copies of data from said host to multiple locations on the ^ bascd ^ cachc - 

said array of disk drive storage devices utilizing a least ll ' Apparatus in accordance with claim 5 wherein said 

recently used/fastest fit cache management strategy, controller includes means responsive to a read command 

and means responsive to commands from the host to 30 firom the host for recovering data from the location fn the 

read data from an address in the subsystem by inspect- subsystem corresponding to the said host Specified i ^drcss 

ing the contents of said array of disks to identify the and for storing said data in a plurality of said disks \\f said 

presence of data corresponding to said address, said d *^ ^ ttScd d ^ cache. 

controller further including means for transferring said - ^ Apparatus in accordance with claim 11 where 1 p said 

address corresponding data from said array of disks to controller includes means responsive to a favorable deter- 

the host. minaiion of the presence of requested data in said disl ; based 

2. Apparatus in accordance with claim 1 wherein said ^isk cache for selecting the said disk of said cache whi.^ has 
controller further includes means operable upon a failure to ils rcad be * d * proximity to said requested lata, 
find data in said array of disks corresponding lo said address ^ 13 * Apparatus ^ accordance with claim 12 where; p said 
for recovering the data from the subsystem for both transfer controller includes means for dctennining that said c; >nirol^ 
to the host and storage in a plurality of said disks. ier mUSl P roccss othcr fractions before handling it? host 

3. Apparatus in accordance with claim 1 wherein said read command associated with the said disk selecti r£ and 
controller includes means for sensing the disk in said array including means for determining the total time to 
of rf^W having its read/write head closest to said data handle all said other functions before selecting the said disk 
requested by the host. 45 & avi °g il $ read head in closest proximity to said re< tested 

4. Apparatus in accordance with claim 3 wherein said da V** _ 

controller includes means for determining the time required 14 1116 method of utilizing a mulUdimensiooal a ray 0 f 

for accommodating any previously occurring Operations low cost disks for mterfacmg>>eiween a hw data pro< ^ssing 

with said array of disk s before determining which said disk „ dfivice ^ a ^system capable of storing large quant [ies of 

has its said head closest to the data associated with a given daU comprising toe steps of: 

read request from the host. storing multiple copies of data in multiple locati ns on 

5. Apparatus for interfacing between a host data process- said array of disk drive storagedevices utilizing i least 
ing device and a subsystem capable of storing large quan- recently used/fastest fit cache management srra >gy; 
lilies of data comprising: 55 receiving data exchanging commands from said h : ?st; 

a multidimensional array of low cost disks arranged as a responding to commands from said host to read dat ; ; from 

disk based disk cache; an address in the subsystem by inspecting the agents 

a controller including a programmed processor, said con- of said array of disks to determine whether th } data 

troller being coupled for exchanging data and com- corresponding to said address is present in at lei, >t one 

mands with the host and with the subsystem wherein 60 of said disks in said array; and 

read commands from the host specify an address in the transferring said address corresponding data fro* 1 1 said 

subsystem for identifying requested data; disk to said host in response to a favorable resul from 

means coupling said disk cache with said controller for said inspecting step, 

operating said disks in response to output signals from 15. The method in accordance ; with claim 14 yhich 

said controller; and cs includes the steps of responding to a failure to pro! ucc a 

Raid ooncroller including mcajjs for distributing multiple positive result from said inspecting step by rccoveriiig the 

copies of data from said host to multiple locations on data from the subsystem, transferring said recovered 1 ata to 
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said host, and storing said recovered data in a plurality of 
said disks. 

16, The method in accordance with claim 14 which 
includes the step of selecting the said disk having its 
read/write head closest to said data requested by the host, 
a^ producing to said host the data from said selected disk. 



17. The method in accordance with claim 16 



includes tiie steps of determining the time requiihd for 



accommodating any previously occurring operation] 
said disks prior to performing said selecting step, 



> with 



^hich 
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Applicant Michael Kowalchik, et al RECEIVED APPEAL BRIEF 

U.S.S.N.: l(W3l,622 CENTRAL PAX GIMTIR 

Filing Daie: December 9, 2003 ^ weWTER 

EMC Docket No.: EMC-01-102CIP1 p£g ^ j 

XL CONCLUSION 

For the reasons set forth above, applicants respectfully assert that the rejection of claims 
30-32, 34-43, 45 and 47-50 is unwarranted and improper and that the claims are allowable over 
the art of record. Accordingly, applicants respectfully request removal of the obviousness 
rejection of the claims 30-32, 34-43, 45 and 47-50 over Brandt in view of Rao. 

Respectfully submitted, 

Date Scott A. Ouellette, Esq. 

Reg. No. 38,573 
EMC Corporation 
176 South Street 
Hopkinton, MA 01748 
Telephone: (508) 293-7835 
Facsimile: (508)497-6915 
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